Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
RNA Biol ; 19(1): 1208-1227, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-36384383

RESUMO

This study investigates the importance of the structural context in the formation of a type I/II A-minor motif. This very frequent structural motif has been shown to be important in the spatial folding of RNA molecules. We developed an automated method to classify A-minor motif occurrences according to their 3D context similarities, and we used a graph approach to represent both the structural A-minor motif occurrences and their classes at different scales. This approach leads us to uncover new subclasses of A-minor motif occurrences according to their local 3D similarities. The majority of classes are composed of homologous occurrences, but some of them are composed of non-homologous occurrences. The different classifications we obtain allow us to better understand the importance of the context in the formation of A-minor motifs. In a second step, we investigate how much knowledge of the context around an A-minor motif can help to infer its presence (and position). More specifically, we want to determine what kind of information, contained in the structural context, can be useful to characterize and predict A-minor motifs. We show that, for some A-minor motifs, the topology combined with a sequence signal is sufficient to predict the presence and the position of an A-minor motif occurrence. In most other cases, these signals are not sufficient for predicting the A-minor motif, however we show that they are good signals for this purpose. All the classification and prediction pipelines rely on automated processes, for which we describe the underlying algorithms and parameters.


Assuntos
Imageamento Tridimensional , RNA , Algoritmos , Valor Preditivo dos Testes , Humanos , RNA/química
2.
PLoS Comput Biol ; 17(5): e1008990, 2021 05.
Artigo em Inglês | MEDLINE | ID: mdl-34048427

RESUMO

RNA tertiary structure is crucial to its many non-coding molecular functions. RNA architecture is shaped by its secondary structure composed of stems, stacked canonical base pairs, enclosing loops. While stems are precisely captured by free-energy models, loops composed of non-canonical base pairs are not. Nor are distant interactions linking together those secondary structure elements (SSEs). Databases of conserved 3D geometries (a.k.a. modules) not captured by energetic models are leveraged for structure prediction and design, but the computational complexity has limited their study to local elements, loops. Representing the RNA structure as a graph has recently allowed to expend this work to pairs of SSEs, uncovering a hierarchical organization of these 3D modules, at great computational cost. Systematically capturing recurrent patterns on a large scale is a main challenge in the study of RNA structures. In this paper, we present an efficient algorithm to compute maximal isomorphisms in edge colored graphs. We extend this algorithm to a framework well suited to identify RNA modules, and fast enough to considerably generalize previous approaches. To exhibit the versatility of our framework, we first reproduce results identifying all common modules spanning more than 2 SSEs, in a few hours instead of weeks. The efficiency of our new algorithm is demonstrated by computing the maximal modules between any pair of entire RNA in the non-redundant corpus of known RNA 3D structures. We observe that the biggest modules our method uncovers compose large shared sub-structure spanning hundreds of nucleotides and base pairs between the ribosomes of Thermus thermophilus, Escherichia Coli, and Pseudomonas aeruginosa.


Assuntos
Conformação de Ácido Nucleico , RNA/química , Algoritmos , Pareamento de Bases , Biologia Computacional/métodos
3.
BMC Bioinformatics ; 20(1): 19, 2019 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-30630411

RESUMO

BACKGROUND: In systems biology, there is an acute need for integrative approaches in heterogeneous network mining in order to exploit the continuous flux of genomic data. Simultaneous analysis of the metabolic pathways and genomic context of a given species leads to the identification of patterns consisting in reaction chains catalyzed by products of neighboring genes. Similar such patterns across several species can reveal their mode of conservation throughout the tree of life. RESULTS: We present CoMetGeNe (COnserved METabolic and GEnomic NEighborhoods), a novel method that identifies metabolic and genomic patterns consisting in maximal trails of reactions being catalyzed by products of neighboring genes. Patterns determined by CoMetGeNe in one species are subsequently employed in order to reflect their degree of conservation across multiple prokaryotic species. These interspecies comparisons help to improve genome annotation and can reveal putative alternative metabolic routes as well as unexpected gene ordering occurrences. CONCLUSIONS: CoMetGeNe is an exploratory tool at both the genomic and the metabolic levels, leading to insights into the conservation of functionally related clusters of neighboring enzyme-coding genes. The open-source CoMetGeNe pipeline is freely available at https://cometgene.lri.fr .


Assuntos
Bactérias/genética , Bactérias/metabolismo , Biologia Computacional/métodos , Genoma Bacteriano , Genômica/métodos , Redes e Vias Metabólicas/genética , Software , Bactérias/classificação , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Mineração de Dados , Ordem dos Genes , Especificidade da Espécie , Biologia de Sistemas
4.
Nucleic Acids Res ; 46(8): 3841-3851, 2018 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-29608773

RESUMO

The wealth of the combinatorics of nucleotide base pairs enables RNA molecules to assemble into sophisticated interaction networks, which are used to create complex 3D substructures. These interaction networks are essential to shape the 3D architecture of the molecule, and also to provide the key elements to carry molecular functions such as protein or ligand binding. They are made of organised sets of long-range tertiary interactions which connect distinct secondary structure elements in 3D structures. Here, we present a de novo data-driven approach to extract automatically from large data sets of full RNA 3D structures the recurrent interaction networks (RINs). Our methodology enables us for the first time to detect the interaction networks connecting distinct components of the RNA structure, highlighting their diversity and conservation through non-related functional RNAs. We use a graphical model to perform pairwise comparisons of all RNA structures available and to extract RINs and modules. Our analysis yields a complete catalog of RNA 3D structures available in the Protein Data Bank and reveals the intricate hierarchical organization of the RNA interaction networks and modules. We assembled our results in an online database (http://carnaval.lri.fr) which will be regularly updated. Within the site, a tool allows users with a novel RNA structure to detect automatically whether the novel structure contains previously observed RINs.


Assuntos
Bases de Dados de Ácidos Nucleicos/estatística & dados numéricos , Conformação de Ácido Nucleico , RNA/química , Algoritmos , Pareamento de Bases , Biologia Computacional/métodos , Mineração de Dados/métodos , Bases de Dados de Proteínas/estatística & dados numéricos , Modelos Moleculares , Dobramento de RNA , Software
5.
Bioinformatics ; 33(16): 2479-2486, 2017 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-28398456

RESUMO

MOTIVATION: Predicting the 3D structure of RNA molecules is a key feature towards predicting their functions. Methods which work at atomic or nucleotide level are not suitable for large molecules. In these cases, coarse-grained prediction methods aim to predict a shape which could be refined later by using more precise methods on smaller parts of the molecule. RESULTS: We developed a complete method for sampling 3D RNA structure at a coarse-grained model, taking a secondary structure as input. One of the novelties of our method is that a second step extracts two best possible structures close to the native, from a set of possible structures. Although our method benefits from the first version of GARN, some of the main features on GARN2 are very different. GARN2 is much faster than the previous version and than the well-known methods of the state-of-art. Our experiments show that GARN2 can also provide better structures than the other state-of-the-art methods. AVAILABILITY AND IMPLEMENTATION: GARN2 is written in Java. It is freely distributed and available at http://garn.lri.fr/. CONTACT: melanie.boudard@lri.fr or johanne.cohen@lri.fr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Modelos Moleculares , Conformação de Ácido Nucleico , RNA/química , Software , Algoritmos , RNA/metabolismo , Análise de Sequência de RNA/métodos
6.
PLoS One ; 10(8): e0136444, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26313379

RESUMO

Cellular processes involve large numbers of RNA molecules. The functions of these RNA molecules and their binding to molecular machines are highly dependent on their 3D structures. One of the key challenges in RNA structure prediction and modeling is predicting the spatial arrangement of the various structural elements of RNA. As RNA folding is generally hierarchical, methods involving coarse-grained models hold great promise for this purpose. We present here a novel coarse-grained method for sampling, based on game theory and knowledge-based potentials. This strategy, GARN (Game Algorithm for RNa sampling), is often much faster than previously described techniques and generates large sets of solutions closely resembling the native structure. GARN is thus a suitable starting point for the molecular modeling of large RNAs, particularly those with experimental constraints. GARN is available from: http://garn.lri.fr/.


Assuntos
Algoritmos , Teoria dos Jogos , Bases de Conhecimento , Modelos Moleculares , Dobramento de RNA , RNA/química
7.
Nat Struct Mol Biol ; 21(11): 997-1005, 2014 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-25326705

RESUMO

The U2AF heterodimer has been well studied for its role in defining functional 3' splice sites in pre-mRNA splicing, but many fundamental questions still remain unaddressed regarding the function of U2AF in mammalian genomes. Through genome-wide analysis of U2AF-RNA interactions, we report that U2AF has the capacity to directly define ~88% of functional 3' splice sites in the human genome, but numerous U2AF binding events also occur in intronic locations. Mechanistic dissection reveals that upstream intronic binding events interfere with the immediate downstream 3' splice site associated either with the alternative exon, to cause exon skipping, or with the competing constitutive exon, to induce exon inclusion. We further demonstrate partial functional impairment with leukemia-associated mutations in U2AF35, but not U2AF65, in regulated splicing. These findings reveal the genomic function and regulatory mechanism of U2AF in both normal and disease states.


Assuntos
Processamento Alternativo , Genoma Humano , Proteínas Nucleares/metabolismo , Sítios de Splice de RNA , Ribonucleoproteínas/metabolismo , Sequência de Bases , Sítios de Ligação , Éxons , Células HeLa , Humanos , Íntrons , Dados de Sequência Molecular , Mutação , Proteínas Nucleares/genética , Ligação Proteica , Ribonucleoproteínas/genética , Fator de Processamento U2AF
8.
BMC Genomics ; 15 Suppl 6: S16, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25573073

RESUMO

BACKGROUND: In comparative genomics, orthologs are used to transfer annotation from genes already characterized to newly sequenced genomes. Many methods have been developed for finding orthologs in sets of genomes. However, the application of different methods on the same proteome set can lead to distinct orthology predictions. METHODS: We developed a method based on a meta-approach that is able to combine the results of several methods for orthologous group prediction. The purpose of this method is to produce better quality results by using the overlapping results obtained from several individual orthologous gene prediction procedures. Our method proceeds in two steps. The first aims to construct seeds for groups of orthologous genes; these seeds correspond to the exact overlaps between the results of all or several methods. In the second step, these seed groups are expanded by using HMM profiles. RESULTS: We evaluated our method on two standard reference benchmarks, OrthoBench and Orthology Benchmark Service. Our method presents a higher level of accurately predicted groups than the individual input methods of orthologous group prediction. Moreover, our method increases the number of annotated orthologous pairs without decreasing the annotation quality compared to twelve state-of-the-art methods. CONCLUSIONS: The meta-approach based method appears to be a reliable procedure for predicting orthologous groups. Since a large number of methods for predicting groups of orthologous genes exist, it is quite conceivable to apply this meta-approach to several combinations of different methods.


Assuntos
Biologia Computacional/métodos , Genômica/métodos , Anotação de Sequência Molecular/métodos , Software , Evolução Molecular , Reprodutibilidade dos Testes
9.
Artigo em Inglês | MEDLINE | ID: mdl-23702555

RESUMO

We present a new approach for the prediction of the coarse-grain 3D structure of RNA molecules. We model a molecule as being made of helices and junctions. Those junctions are classified into topological families that determine their preferred 3D shapes. All the parts of the molecule are then allowed to establish long-distance contacts that induce a 3D folding of the molecule. An algorithm relying on game theory is proposed to discover such long-distance contacts that allow the molecule to reach a Nash equilibrium. As reported by our experiments, this approach allows one to predict the global shape of large molecules of several hundreds of nucleotides that are out of reach of the state-of-the-art methods.


Assuntos
Biologia Computacional/métodos , Teoria dos Jogos , Modelos Químicos , Conformação de Ácido Nucleico , RNA/química , Algoritmos , Cristalização , Modelos Moleculares , RNA/genética , RNA/metabolismo , Software
10.
Adv Bioinformatics ; 2012: 893048, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22675348

RESUMO

The pairwise comparison of RNA secondary structures is a fundamental problem, with direct application in mining databases for annotating putative noncoding RNA candidates in newly sequenced genomes. An increasing number of software tools are available for comparing RNA secondary structures, based on different models (such as ordered trees or forests, arc annotated sequences, and multilevel trees) and computational principles (edit distance, alignment). We describe here the website BRASERO that offers tools for evaluating such software tools on real and synthetic datasets.

11.
Comput Biol Chem ; 37: 1-5, 2012 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-22326420

RESUMO

We present an algorithm for automatically predicting the topological family of any RNA three-way junction, given only the information from the secondary structure: the sequence and the Watson-Crick pairings. The parameters of the algorithm have been determined on a data set of 33 three-way junctions whose 3D conformation is known. We applied the algorithm on 53 other junctions and compared the predictions to the real shape of those junctions. We show that the correct answer is selected out of nine possible configurations 64% of the time. Additionally, these results are noticeably improved if homology information is used. The resulting software, Cartaj, is available online and downloadable (with source) at: http://cartaj.lri.fr.


Assuntos
Algoritmos , RNA/química , Homologia de Sequência do Ácido Nucleico , Software , Pareamento de Bases , Bases de Dados Factuais , Modelos Moleculares , Dobramento de RNA , Alinhamento de Sequência , Análise de Sequência de RNA
12.
J Comput Biol ; 18(10): 1339-51, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-21548808

RESUMO

In 2004, Condon and coauthors gave a hierarchical classification of exact RNA structure prediction algorithms according to the generality of structure classes that they handle. We complete this classification by adding two recent prediction algorithms. More importantly, we precisely quantify the hierarchy by giving closed or asymptotic formulas for the theoretical number of structures of given size n in all the classes but one. This allows us to assess the tradeoff between the expressiveness and the computational complexity of RNA structure prediction algorithms.


Assuntos
Algoritmos , Biologia Computacional/métodos , Modelos Moleculares , RNA/química , Simulação por Computador , Conformação de Ácido Nucleico , Probabilidade , Análise de Sequência de RNA
13.
FEBS Lett ; 584(21): 4401-7, 2010 Nov 05.
Artigo em Inglês | MEDLINE | ID: mdl-20888818

RESUMO

In this report, we studied the effect of RNA structures on the activity of exonic splicing enhancers on the SMN1 minigene model by engineering known ESEs into different positions of stable hairpins. We found that as short as 7-bp stem is sufficient to abolish the enhancer activity. When placing ESEs in the loop region, AG-rich ESEs are fully active, but a UCG-rich ESE is not because of additional structural constraints. ESEs placed adjacent to the 3' end of the hairpin structure display high enhancer activity, regardless of their sequence identities. These rules explain the suppression of multiple ESEs by point mutations that result in a stable RNA structure, and provide an additional mechanism for the C6T mutation in SMN2.


Assuntos
Éxons/genética , Conformação de Ácido Nucleico , Splicing de RNA/genética , RNA/química , RNA/genética , Sequências Reguladoras de Ácido Ribonucleico/genética , Pareamento de Bases , Sequência de Bases , Células HeLa , Humanos , Sequências Repetidas Invertidas , Estabilidade de RNA , Proteína 1 de Sobrevivência do Neurônio Motor/genética , Temperatura
14.
Artigo em Inglês | MEDLINE | ID: mdl-20431150

RESUMO

We describe a theoretical unifying framework to express the comparison of RNA structures, which we call alignment hierarchy. This framework relies on the definition of common supersequences for arc-annotated sequences and encompasses the main existing models for RNA structure comparison based on trees and arc-annotated sequences with a variety of edit operations. It also gives rise to edit models that have not been studied yet. We provide a thorough analysis of the alignment hierarchy, including a new polynomial-time algorithm and an NP-completeness proof. The polynomial-time algorithm involves biologically relevant edit operations such as pairing or unpairing nucleotides. It has been implemented in a software, called gardenia, which is available at the Web server http://bioinfo.lifl.fr/RNA/gardenia.


Assuntos
Biologia Computacional/métodos , Modelos Genéticos , RNA/química , Algoritmos , Sequência de Bases , Genes Bacterianos/genética , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Alinhamento de Sequência , Software
15.
Bioinformatics ; 25(15): 1974-5, 2009 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-19398448

RESUMO

DESCRIPTION: VARNA is a tool for the automated drawing, visualization and annotation of the secondary structure of RNA, designed as a companion software for web servers and databases. FEATURES: VARNA implements four drawing algorithms, supports input/output using the classic formats dbn, ct, bpseq and RNAML and exports the drawing as five picture formats, either pixel-based (JPEG, PNG) or vector-based (SVG, EPS and XFIG). It also allows manual modification and structural annotation of the resulting drawing using either an interactive point and click approach, within a web server or through command-line arguments. AVAILABILITY: VARNA is a free software, released under the terms of the GPLv3.0 license and available at http://varna.lri.fr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , RNA/química , Software , Gráficos por Computador , Bases de Dados Factuais , Armazenamento e Recuperação da Informação , Conformação de Ácido Nucleico , Interface Usuário-Computador
16.
RNA ; 14(12): 2489-97, 2008 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-18957493

RESUMO

We used a novel graph-based approach to extract RNA tertiary motifs. We cataloged them all and clustered them using an innovative graph similarity measure. We applied our method to three widely studied structures: Haloarcula marismortui 50S (H.m 50S), Escherichia coli 50S (E. coli 50S), and Thermus thermophilus 16S (T.th 16S) RNAs. We identified 10 known motifs without any prior knowledge of their shapes or positions. We additionally identified four putative new motifs.


Assuntos
RNA/química , Análise por Conglomerados , Escherichia coli/química , Haloarcula marismortui/química , Conformação de Ácido Nucleico , RNA Arqueal/química , RNA Bacteriano/química , Thermus thermophilus/química
17.
FEBS Lett ; 581(27): 5282-8, 2007 Nov 13.
Artigo em Inglês | MEDLINE | ID: mdl-17967457

RESUMO

Pyrrolysyl-tRNA synthetase and its cognate suppressor tRNA(Pyl) mediate pyrrolysine (Pyl) insertion at in frame UAG codons. The presence of an RNA hairpin structure named Pyl insertion structure (PYLIS) downstream of the suppression site has been shown to stimulate the insertion of Pyl in archaea. We study here the impact of the presence of PYLIS on the level of Pyl and the Pyl analog N-epsilon-cyclopentyloxycarbonyl-l-lysine (Cyc) incorporation using a quantitative lacZ-luc tandem reporter system in an Escherichia coli context. We show that PYLIS has no effect on the level of neither Pyl nor Cyc incorporation. Exogenously supplying our reporter system with d-ornithine significantly increases suppression efficiency, indicating that d-ornithine is a direct precursor to Pyl.


Assuntos
Escherichia coli/genética , Código Genético , Lisina/análogos & derivados , Proteínas Arqueais/genética , Sequência de Bases , Primers do DNA/genética , DNA Bacteriano/genética , Escherichia coli/metabolismo , Genes Arqueais , Genes Bacterianos , Lisina/genética , Lisina/metabolismo , Methanosarcina barkeri/genética , Metiltransferases/genética , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Ornitina/metabolismo , RNA Bacteriano/química , RNA Bacteriano/genética , Homologia de Sequência do Ácido Nucleico
18.
Bioinformatics ; 22(12): 1534-5, 2006 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-16574695

RESUMO

SUMMARY: GenRGenS is a software tool dedicated to randomly generating genomic sequences and structures. It handles several classes of models useful for sequence analysis, such as Markov chains, hidden Markov models, weighted context-free grammars, regular expressions and PROSITE expressions. GenRGenS is the only program that can handle weighted context-free grammars, thus allowing the user to model and to generate structured objects (such as RNA secondary structures) of any given desired size. GenRGenS also allows the user to combine several of these different models at the same time.


Assuntos
Biologia Computacional/métodos , Genoma , Software , Simulação por Computador , Cadeias de Markov , Modelos Estatísticos , Conformação de Ácido Nucleico , Fases de Leitura Aberta , Linguagens de Programação , RNA/química
19.
J Comput Biol ; 12(1): 33-47, 2005.
Artigo em Inglês | MEDLINE | ID: mdl-15725732

RESUMO

Pairwise sequence alignments aim to decide whether two sequences are related and, if so, to exhibit their related domains. Recent works have pointed out that a significant number of true homologous sequences are missed when using classical comparison algorithms. This is the case when two homologous sequences share several little blocks of homology, too small to lead to a significant score. On the other hand, classical alignment algorithms, when detecting homologies, may fail to recognize all the significant biological signals. The aim of the paper is to give a solution to these two problems. We propose a new scoring method which tends to increase the score of an alignment when "blocks" are detected. This so-called Block-Scoring algorithm, which makes use of dynamic programming, is worth being used as a complementary tool to classical exact alignments methods. We validate our approach by applying it on a large set of biological data. Finally, we give a limit theorem for the score statistics of the algorithm.


Assuntos
Algoritmos , Biologia Computacional/métodos , RNA/química , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico , Sequência de Bases , Simulação por Computador , Dados de Sequência Molecular
20.
Bioinformatics ; 19(3): 327-35, 2003 Feb 12.
Artigo em Inglês | MEDLINE | ID: mdl-12584117

RESUMO

MOTIVATION: Unconventional decoding events are now well acknowledged, but not yet well formalized. In this study, we present a bioinformatics analysis of eukaryotic -1 frameshifting, in order to model this event. RESULTS: A consensus model has already been established for -1 frameshifting sites. Our purpose here is to provide new constraints which make the model more precise. We show how a machine learning approach can be used to refine the current model. We identify new properties that may be involved in frameshifting. Each of the properties found was experimentally validated. Initially, we identify features of the overall model that are to be simultaneously satisfied. We then focus on the following two components: the spacer and the slippery sequence. As a main result, we point out that the identity of the primary structure of the so-called spacer is of great importance. AVAILABILITY: Sequences of the oligonucleotides in the functional tests are available at http://www.igmors.u-psud.fr/rousset/bioinformatics/.


Assuntos
Mudança da Fase de Leitura do Gene Ribossômico/genética , Regulação da Expressão Gênica , Modelos Genéticos , Análise de Sequência de RNA/métodos , Vírus/genética , Algoritmos , Animais , Inteligência Artificial , Sequência de Bases , Aves , Simulação por Computador , DNA Espaçador Ribossômico/genética , Células Eucarióticas , Regulação Viral da Expressão Gênica , Haplorrinos , Humanos , Dados de Sequência Molecular
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...